.Net: Introduce support for response modalities and audio options in `AzureClientCore` #12523

Cobra86 · 2025-06-18T11:15:42Z

Motivation and Context

This change extends AzureClientCore to handle response modalities and audio options dynamically based on user-provided executionSettings.

Why this is needed:

Currently, the OpenAI connector supports audio modalities through the Modalities and Audio properties in OpenAIPromptExecutionSettings, but the Azure OpenAI connector doesn't fully implement this functionality.

The code for handling audio exists in AzureClientCore.cs but isn't included in AzureClientCore.ChatCompletion.cs.

Description

Introduced GetResponseModalities and GetAudioOptions helper methods. These follow the same logic as the equivalent methods in the OpenAI connector to ensure consistent behaviour and reduce duplication across both clients.
Updated CreateChatCompletionOptions to:
- Parse and apply ResponseModalities if specified in executionSettings.
- Parse and apply AudioOptions if specified in executionSettings.
- Ensured backward compatibility: defaults remain unchanged if these options are not provided.

Contribution Checklist

The code builds cleanly without any errors or warnings.
The PR follows the [SK Contribution Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) and the pre-submission formatting script raises no violations.
All unit tests pass, and I didn't create new tests as following the same in OpenAI.
Verified that no existing functionality is broken. 😊

Introduce methods to handle response modalities and audio options in AzureClientCore. Add checks for executionSettings.Modalities and executionSettings.Audio to dynamically configure options based on user settings. Implement GetResponseModalities and GetAudioOptions methods to support various input formats, improving flexibility and robustness.

Cobra86 · 2025-06-18T11:18:31Z

@microsoft-github-policy-service agree

Cobra86 · 2025-06-18T11:21:53Z

This is for this request #11720

RogerBarreto · 2025-06-18T13:37:41Z

@Cobra86 Thank you for adding the support, to get those one in, we also need Unit Tests similar how we do have for OpenAI Connector. Please add those. Overall LGTM.

Implemented tests in `AzureOpenAIChatCompletionServiceTests` to verify the correct handling of audio content in requests and responses. This includes checks for sending audio content, processing audio responses, and handling audio metadata. Introduced new theory data members for validating response modalities and audio options.

Cobra86 · 2025-06-18T15:25:02Z

@RogerBarreto Thank you :). I've added the tests same as OpenAI. Please let me know if i need more tests

dotnet/src/Connectors/Connectors.AzureOpenAI/Core/AzureClientCore.ChatCompletion.cs

westey-m

Looks good, just a couple small comments about error messaging, to help users troubleshoot failures.

Enhance error handling by introducing try-catch blocks for JSON deserialization, providing clearer exception messages for unsupported modalities and invalid audio options. Refactor parsing logic for string modalities to improve code readability and maintainability.

Cobra86 · 2025-06-20T17:47:42Z

Looks good, just a couple small comments about error messaging, to help users troubleshoot failures.

Thanks. I've made the changes. Please review it and let me know.

Cobra86 requested a review from a team as a code owner June 18, 2025 11:15

markwallace-microsoft added .NET Issue or Pull requests regarding .NET code kernel Issues or pull requests impacting the core kernel labels Jun 18, 2025

github-actions bot changed the title ~~Introduce support for response modalities and audio options in AzureClientCore~~ .Net: Introduce support for response modalities and audio options in AzureClientCore Jun 18, 2025

Cobra86 mentioned this pull request Jun 18, 2025

.Net: Bring Support for Azure OpenAI gpt-4o audio responses #11720

Open

RogerBarreto assigned Cobra86 and RogerBarreto Jun 18, 2025

Cobra86 temporarily deployed to integration June 18, 2025 16:01 — with GitHub Actions Inactive

RogerBarreto approved these changes Jun 19, 2025

View reviewed changes

westey-m reviewed Jun 20, 2025

View reviewed changes

dotnet/src/Connectors/Connectors.AzureOpenAI/Core/AzureClientCore.ChatCompletion.cs Outdated Show resolved Hide resolved

westey-m reviewed Jun 20, 2025

View reviewed changes

dotnet/src/Connectors/Connectors.AzureOpenAI/Core/AzureClientCore.ChatCompletion.cs Outdated Show resolved Hide resolved

westey-m reviewed Jun 20, 2025

View reviewed changes

dotnet/src/Connectors/Connectors.AzureOpenAI/Core/AzureClientCore.ChatCompletion.cs Outdated Show resolved Hide resolved

westey-m requested changes Jun 20, 2025

View reviewed changes

Merge branch 'main' into dotnet-azure-openai-gpt4o-audio-responses

f50b060

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

.Net: Introduce support for response modalities and audio options in `AzureClientCore` #12523

.Net: Introduce support for response modalities and audio options in `AzureClientCore` #12523

Cobra86 commented Jun 18, 2025

Uh oh!

Cobra86 commented Jun 18, 2025

Uh oh!

Cobra86 commented Jun 18, 2025

Uh oh!

RogerBarreto commented Jun 18, 2025

Uh oh!

Cobra86 commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

westey-m left a comment

Uh oh!

Cobra86 commented Jun 20, 2025

Uh oh!

Uh oh!

.Net: Introduce support for response modalities and audio options in AzureClientCore #12523

Are you sure you want to change the base?

.Net: Introduce support for response modalities and audio options in AzureClientCore #12523

Conversation

Cobra86 commented Jun 18, 2025

Motivation and Context

Description

Contribution Checklist

Uh oh!

Cobra86 commented Jun 18, 2025

Uh oh!

Cobra86 commented Jun 18, 2025

Uh oh!

RogerBarreto commented Jun 18, 2025

Uh oh!

Cobra86 commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

westey-m left a comment

Choose a reason for hiding this comment

Uh oh!

Cobra86 commented Jun 20, 2025

Uh oh!

Uh oh!

.Net: Introduce support for response modalities and audio options in `AzureClientCore` #12523

.Net: Introduce support for response modalities and audio options in `AzureClientCore` #12523